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Abstract. We perform a detailed forecast on how well a EuCLiD-like photometric galaxy 
and cosmic shear survey will be able to constrain the absolute neutrino mass scale. Adopting 
conservative assumptions about the survey specifications and assuming complete ignorance 
of the galaxy bias, we estimate that the minimum mass sum of m v — 0.06 eV in the normal 
hierarchy can be detected at 1.5a to 2.5a significance, depending on the model complexity, 
using a combination of galaxy and cosmic shear power spectrum measurements in conjunction 
with CMB temperature and polarisation observations from Planck. With better knowledge 
of the galaxy bias, the significance of the detection could potentially reach 5. 4a. Interestingly, 
neither PLANCK+shear nor PLANCK+galaxy alone can achieve this level of sensitivity; it is 
the combined effect of galaxy and cosmic shear power spectrum measurements that breaks 
the persistent degeneracies between the neutrino mass, the physical matter density, and the 
Hubble parameter. Notwithstanding this remarkable sensitivity to ^r?v, EuCLiD-like shear 
and galaxy data will not be sensitive to the exact mass spectrum of the neutrino sector; no 
significant bias (< la) in the parameter estimation is induced by fitting inaccurate models of 
the neutrino mass splittings to the mock data, nor does the goodness-of-fit of these models 
suffer any significant degradation relative to the true one (Axlg < 1). 
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1 Introduction 



The past decade has seen tremendous advances in the field of precision cosmology. Measure- 
ments of the cosmic microwave background (CMB) temperature and polarisation anisotropies, 
especially by the WMAP mission [1], have yielded information on the structure and content 
of the universe on length scales up to the current Hubble distance. Likewise, large-volume 
galaxy surveys such as the Sloan Digital Sky Survey (SDSS) [2], which has catalogued the red- 
shifts and angular positions of close to a million galaxies, provide a handle on the distribution 
of matter on intermediate scales. 

The coming decade is likely to be just as exciting. The Planck mission [3] will mea- 
sure the CMB anisotropies to an unprecedented precision, and new techniques designed to 
map out the structures in the low-redshift universe will reach maturity. Of these new tech- 
niques, the most promising is that based on the detection of cosmic shear, i.e., the weak 
gravitational lensing of distant galaxy images by the intervening large-scale structure. Cos- 
mic shear as a probe of the large-scale matter distribution has already been demonstrated 
to yield competitive cosmological constraints with the CFHTLS [4] and the COSMOS [5] 
surveys. Upcoming projects such as the Dark Energy Survey (DES) [6], the Large Synoptic 
Survey Telescope (LSST) [7], and Euclid [8] will take cosmic shear to a new level through 
an enormous increase in the survey volume. 

In this paper, we adopt the technical specifications projected for the photometric redshift 
survey of ESA's Euclid mission, which has been approved for launch in 2019, and focus 
specifically on the cosmological information contained therein. The Euclid photometric 
survey will be dedicated to the measurement of cosmic shear. At the same time, such a 
measurement will also contain the necessary information to enable the use of the galaxy 
data as a tracer of the underlying (angular) matter distribution. This dual use of the survey 
data turns out to be a great advantage for cosmological parameter estimation, especially 
for the measurement of the absolute neutrino mass scale. As we shall show, when used 
in combination, a EUCLID-Iike cosmic shear and galaxy distribution measurement has the 
potential to detect the sum of the neutrino masses ^ m u at better than 5cr significance, even 
if the true ^m,, is the minimum allowed by neutrino oscillation experiments (~ 0.06 eV). 
This is a significantly better performance than using either shear or galaxy distribution data 
alone. 

The paper is organised as follows. In section 2 we describe the angular power spectrum 
formalism which we use to quantify both the shear and the galaxy angular clustering two- 
point statistics. Section 3 discusses the statistical and systematic errors pertinent to the 
measurements, while in sections 4 and 5 we outline our mock data generation and forecasting 
procedures, generalising the Markov Chain Monte Carlo method introduced in reference [9] . 
Our results are presented in sections 6 and 7 for minimal and non-minimal cosmological 
models, respectively. We conclude in section 8. A detailed semi-analytic analysis of the 
parameter degeneracies in the cosmological probes considered can be found in the appendix. 

2 Euclid observables 

We focus in this work on the photometric galaxy and cosmic shear surveys, which will be 
one of the main data products from Euclid. For the two observables — the positions of 
galaxies and the corresponding shear in their images, we quantify their clustering statistics 
at lowest order in terms of an angular harmonic power spectrum. We opt for this measure 
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instead of the more familiar three-dimensional power spectrum (as in, e.g., [10]), because 
of the limits imposed by photometry. The reconstruction of the three-dimensional power 
spectrum requires an accurate measurement of the redshift space distortions along the line- 
of-sight. While this is well within the capacity of surveys that employ spectroscopy for redshift 
determination, photometry has too large an uncertainty to enable this measurement. The 
angular power spectrum was recently adopted in the analysis of the SDSS III photometric 
survey [11]. 

2.1 Angular power spectra 

From the two types of observables in our framework, the angular distributions of galaxies (g) 
and shear (s) on the sky, we can construct three distinct types of two-point statistics in 
harmonic space: the shear auto-spectrum C| s , the galaxy auto-spectrum C| g , and the shear- 
galaxy cross-spectrum C^ g , where the subscript i refers to the angular multipole. The shear 
observable can itself be decomposed into a curl-free e-mode and a divergence-free /3-mode. 
However, lensing by scalar metric perturbations at linear order does not produce /3-mode 
shear. Henceforth, the label "s" shall denote exclusively e-mode shear. We also consider the 
possibility that the shear and the galaxy angular distributions will be measured in several 
redshift intervals, and adopt the notation C?X to denote the angular auto- or cross-spectrum 
of the observables X, Y = s, g in the redshift bins i and j. 

Given a cosmological model, the angular power spectra can be computed from theory 
according to 

C% = 4tt J dlnkSX(k)Sl(k)Vn(k). (2.1) 

Here, Vn(k) is the dimensionless power spectrum of the primordial curvature perturba- 
tions TZ(k), and the source functions for shear and galaxies read 



Sh = -2 \ f {t2 l) } t + 2) I dxMkx)Wl(x)Ty(k, m - X ), (2.2) 



Sfi = / dx jt(k X ) Wf (X) T 5 (k, m - X ), (2.3) 



where the integration variable x is the comoving distance, tjq the conformal time today, 
je are the spherical Bessel functions, 1 and T^ : s are the transfer functions of the metric 
perturbations 2 * = (ip + (/>)/ 2 and the matter density fluctuation 5 respectively, defined 
via *!/(&, 77) = T^,(k,r])lZ(k) and 5(k,rj) = Tg(k,r])TZ(k). In ACDM-type cosmologies, the 
two transfer functions are related to one another at low redshifts via the Poisson equation 
k 2 T^(k,rj) = A7ra 2 (r])p m (r])Ts(k,r]) on subhorizon scales. 

The source functions (2.2) and (2.3) contain the window functions given by 

Wf(x)= rV^W), (2.4) 
J x x'x 

Wf(x) = b(k, X )ni(x), (2.5) 

where 

fnix) = H(z)hi(z) = H(z) dn/dz(z \ (2.6) 

m 



x For non-flat universes, x should be replaced with the comoving angular diameter distance and je with the 
ultra-spherical Bessel function [12]. 

2 We use the conformal Newtonian gauge whose line element is ds 2 = —a 2 (r/)[(l+2ip)dr] 2 + (l—2(f>)'yijdx' dx-*]. 



- 3 - 



encodes the redshift distribution of the source galaxies normalised to the surface density (i.e., 
number of galaxies per unit solid angle) 



dz' dn(z')/dz' (2.7) 

Azi 



in the redshift bin i, and dn(z)/dz is a survey-specific function to be specified in section 2.2. 
The galaxy window function Wf contains an additional dependence on the galaxy bias b(k, x), 
which relates the galaxy number density fluctuations to the underlying matter density per- 
turbations. 

Note that, at first glance, our definitions of the galaxy and shear power spectra in 
equations (2.1) to (2.3) appear to differ from their conventional forms found in the literature 
(see, e.g., [13]). This difference can be traced to the following: 

1. We have not implemented a priori the Limber approximation, which consists in sim- 
plifying the source function integrals (2.2) and (2.3) to 



l{p -!)(£ + 2) fir 



X =i/k 



7T 



2£k 2 



Wf(x)T s (k, V0 -x) 



(2.8) 
(2.9) 



X =i/k 

We do however use the approximation in our numerical computations at £ > 1000. 

2. We have opted to highlight the fact that shear probes the metric perturbations, while 
galaxy distribution probes the matter density fluctuations. While these quantities are 
correspondent under the Einstein equation in simple ACDM-type models, the distinc- 
tion may be important in modified gravity scenarios, as well as in models that predict 
dark energy clustering. 



3. The factor y 1 £{£ 2 — 1)(£ + 2)/4 in the shear source function arises from the fact that we 
are computing the e-mode shear without resorting to the flat-sky approximation [12]. 

For illustration, figures 1 and 2 show several sample spectra for our benchmark cosmological 
model (see section 5.1 for the cosmological parameter values). 

2.2 Source galaxy redshift distribution 

We model the redshift distribution of the source galaxies (2.6) using 

dn(z)/dz cx z 2 exp(-(z/z f), (2.10) 

where zq and /3 are parameters specific to the survey. This is a universal form for any 
magnitude-limited survey. The factor z 2 , together with the infinitesimal redshift element dz, 
associates a volume to a given solid angle, and the exponential cut-off truncates the dis- 
tribution beyond the distance at which a typical survey galaxy has an apparent magnitude 
equivalent to the limiting magnitude of the survey [14-16]. In our analysis we set j3 = 1 and 
choose zq = 0.3 so that we recover Euclid's projected median survey redshift of roughly 0.8. 
Note however that our final results (i.e., the estimated errors on various cosmological pa- 
rameters) do not depend strongly on the details of the source galaxy redshift model, as long 
as it follows qualitatively the shape described by the expression (2.10) and has the specified 
median redshift. 
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Figure 1. Angular power spectra for three redshift bins (bin 1: z £ [0,0.61], bin 2: z G [0.61,1.03], 
bin 3: z £ [1.03,3]). Thick solid lines are the bin-autocorrelation spectra, thin solid lines bin-cross- 
corrclations, both include HaloFit nonlinear corrections. The corresponding linear theory spectra are 
plotted as dashed lines. Top: cosmic shear angular power spectra. Bottom: galaxy angular power 
spectra. At low multipoles £, the galaxy signals in different redshift bins are anticorrclated. The 
(1, 3)-cross-spectrum is strongly suppressed because the corresponding bins are not adjoining, and is 
therefore not shown in the plot. 




3 Measurement errors 

Measurements of the galaxy and the shear angular power spectra CfJ- are subject first 
and foremost to cosmic variance, which poses an irreducible uncertainty of AC^/C^ = 
y/2/(2£ + 1) on how well the power spectra can be determined even in a perfectly con- 
trolled observation. Cosmic variance is folded into our analysis at the level of the likelihood 
function, to be discussed in section 5.2. Other sources of measurement errors need to be 
dealt with on a more fundamental level. We discuss these below. 
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Figure 2. Angular shear-galaxy cross-power spectra for two identical redshift bins in shear and 
galaxies (bin 1: z G [0,0.61], bin 2: z e [0.61,1.03]). The first number denotes the index of the 
shear redshift bin, the second the index of the galaxy bin. Note that unlike the ss and gg bin-cross- 
correlations, the sg cross-correlations are not symmetric in the redshift bin indices. Solid lines include 
HaloFit nonlinear corrections, while dashed lines are the corresponding linear theory spectra. 

3.1 Shot noise 

A finite surface density of source galaxies limits the largest multipoles at which can be 
accurately measured. This finite surface density appears as a shot-noise term 

6C nJse,ij = $ij5xY(Z?) 2 n- 2 , (3.1) 

which contributes to the observed angular power spectra in addition to the "signal" C?? . 
Here, E? = 1 for the galaxy power spectrum measurement, E^ = (7 2 } 1 / 2 corresponds to 
the root-mean-square ellipticity of the source galaxies for cosmic shear ((7 2 } 1/2 ~ 0.16 for 
Euclid [8]), and the Kronecker deltas <5jj and 5xy ensure that the noise term contributes 
only to autocorrelation spectra. The source galaxy surface densities rii in the redshift bin i 
can be computed as per equation (2.7), under the condition that Yli n % corresponds to the 
total galaxy surface density for the survey in question: for Euclid, this is anticipated to be 
40 galaxies/arcmin 2 [8]. 

3.2 Photometric redshift uncertainties 

The Euclid photometric survey does not have direct redshift information other than through 
filters. However, a combination of the spectroscopic survey and ground-based near-infrared 
photometry will be used to calibrate the photometric redshift measurements, and a precision 
of <j{z) ~ 0.03(1 + z) has been shown to be achievable [8]. Because Euclid will not be able to 
measure in the near-infrared, the addition of ground-based observations will also be crucial 
for minimising systematic biasing of the photometric redshift estimates [17]. 

We model the photometric redshift uncertainty as a simple Gaussian error of standard 
deviation <r(z) and with no bias. This error can be folded into the window functions as 
described in reference [18]. While cr(z) of the magnitude expected for Euclid is not critical 
for parameter estimation in ACDM-type models, it can lead to a significant degradation in 
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the survey's sensitivity to, for example, a time-dependent equation of state parameter in 
dynamical dark energy models [18]. 

In principle, redshift space distortions due to the collapse of large structures further 
affect the window functions. However, since these are of order 10 3 km s _1 , corresponding to 
5z ~ 0.003, they will be subdominant to the effect of the photometric redshift error at all 
but the smallest redshifts. We therefore ignore them in the following. 

3.3 Shear-specific systematic errors 

Shear measurement errors An additional source of error comes from the experimental 
extraction of shear from the galaxy images. For example, surveys like Euclid must be 
calibrated against large-scale simulation templates, and errors in the calibration result in an 
overall multiplicative error in the shear estimate. Spurious contribution to the shear arises 
also from, e.g., an asymmetric point spread function, resulting in a so-called additive error. 
A large and concerted effort is currently ongoing to quantify the magnitude of multiplicative 
and additive systematics, and to devise ways to correct for them (see, e.g., [19-22]). While 
it is currently not obvious that the bias introduced by multiplicative and additive errors is 
correctable to the level where it is no longer a limiting factor for the Euclid survey [22], we 
will assume that the correction is possible, and ignore these systematics in our analysis. 

Intrinsic alignment Intrinsic alignment refers to the spurious shear correlation signal that 
arises (i) when the source galaxies are close in physical space and their intrinsic ellipticities 
(i.e., their orientations) becomes aligned by the tidal forces of the gravitational field, and 
(ii) when the tidal-force field experienced by one source galaxy also causes the lensing of the 
images of higher-redshift galaxies at close projected distances, thereby leading to an alignment 
between galaxy images at different redshifts. See, e.g., [23-26] for detailed discussions of 
intrinsic alignment. 

Because intrinsic alignment affects those galaxies that are close in either physical or 
projected distance, for a fixed multipole £ we expect its contribution to the signal to increase 
with decreasing redshifts. For this reason, one may contemplate the exclusion of redshift 
bins below a certain threshold from the analysis. However, because of the strong correlation 
with neighbouring bins, the very lowest redshift bins for shear are in most cases not crucial 
for parameter estimation, except perhaps in models that evolve rapidly at very late times 
(as might be the case in models with a time-dependent dark energy equation of state). The 
removal of intrinsic alignment effects in the Euclid data also appears to be possible (see, 
e.g., [27, 28]). We therefore do not cut the analysis at low redshifts, but rather keep the 
minimum z at zero. 

4 Mock data generation 

Generalising the approach of reference [9], we construct mock galaxy and shear angular auto- 
and cross-correlation spectra based on the specifications and error expectations of the Euclid 
survey. We describe the details of the construction below. The synthetic data sets to be used 
in our parameter error forecast analysis are summarised in section 4.5. 
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4.1 Mock angular power spectra 

For a full-sky observation, a fully random mock data synthesis requires that we pick a fiducial 
cosmological model, and compute for it the total theoretical covariance matrix 

Q^j = + 5C noisej ij, (4.1) 

where is now understood to include all error contributions discussed in section 3 except 
shot noise, which is encoded separately in SC^J se f -. A random realisation of the observables 
a fm ODe yi n g can then be obtained following the multivariate random number generation 

procedure described in [9, 29], such that Cf^ = {21 + l)^ 1 Ylm=-£ a fm* a Jm & ves the mock 
angular power spectra. The procedure is repeated multiple times to generate an ensemble of 
realisations. 

However, for our purpose of estimating Euclid's sensitivity to various cosmological 
parameters (i.e., finding the size of the error bars), it is not necessary to generate a full 
ensemble of mock data; it suffices to set the mock angular power spectra Cf^ to be equal 
to the total covariance matrix C?? of the fiducial model, and hence perform the parameter 
estimation directly with the fiducial angular power spectra themselves. 

4.2 Partial sky coverage 

In principle, the above mock data generation procedure applies only in the case of a full-sky 
observation. The Euclid photometric survey, however, will cover only some 15,000 square 
degrees of the sky, equivalent to an effective sky fraction of / s k y = 0.3636. Any realistic survey 
that does not have full sky coverage necessarily yields angular power spectra that are 
correlated between ^-values because of any incomplete (9, </>)-basis on which to perform the 
spherical harmonic decomposition. 

The precise correlations between the multipoles will depend on the survey masks to be 
used. However, for surveys such as the Euclid photometric survey which have a reasonably 
symmetric mask (mainly due to a cut at low galactic latitudes), the loss of information due 
to the correlations between multipoles can be modelled approximately by augmenting the 
cosmic variance uncertainty as ACe — > \J\j /sky ACe. This does not affect the mock data 
generation procedure, but will have to be implemented in the parameter inference analysis 
at the level of the likelihood function (see section 5.2). 

4.3 Redshift binning 

Although our source galaxy redshift model (2.10) contains a built-in exponential cut-off at 
high redshifts, in practice we limit our analysis to the redshift interval z € [0, 3] because 
Euclid will observe very few galaxies beyond z ~ 3. This happens both because of the 
magnitude limitations of the survey and because the peak emission wavelengths are shifted 
out of the detectable region [8] . 

A simple redshift binning scheme would be to slice the complete redshift range into 
bins of equal widths Az. However, because dn(z)/dz is not a constant in z, such a binning 
scheme would lead to unequal galaxy surface densities and hence disparate noise properties 
(see section 3.1) in different redshift bins. Here, we adopt a different binning scheme, and 
choose our redshift slices in such a way that all bins contain the same rij. Compared with a 
constant-Az binning, this scheme leads to narrower bins around the peak of dn{z)/dz, and 
wider ones in the tails. By requiring the same noise contributions in all bins, we have thus 
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Figure 3. Two examples illustrating our rcdshift binning scheme and the corresponding redshift- 
dependent for mock galaxy data. Black lines denote eight redshift bins, while red lines represent a 
two-bin scheme. Solid lines correspond to e n i = 1, and dashed lines represent the linear-only case with 
e n ] = 0.1. Also plotted in gold is the differential source galaxy surface density from equation (2.10), 
with an arbitrary normalisation. 

avoided having "useless" bins that are swamped by noise. Additionally, this method is not 
very sensitive to the choice of the high-redshift cut-off, as long as it is far in the tail of the 
source galaxy redshift distribution. 

Figure 3 shows two examples of our equal rtj redshift binning scheme, one for eight bins 
and one for two bins. 

4.4 Nonlinearity, scale-dependent bias, and the maximum angular harmonic 

The contribution of nonlinearities on small scales is an important issue for cosmological 
parameter inference from large-scale structure observations. At the most basic level, the 
breakdown of linear perturbation theory as a description of the gravitational clustering of 
(dark) matter at large wavenumbers has observational consequences for all large-scale struc- 
ture surveys. This kind of nonlinearity is generally calculable by way of A^-body simulations, 
at least on the mildly nonlinear scales. At a more advanced level, observational probes that 
make use of tracers such as galaxies suffer in addition the presence of a scale-dependent 
bias, which is a considerably more severe form of nonlinearities because it cannot be pre- 
dicted from first principles. Measures need to be taken to ensure that we account for these 
nonlinear effects properly in our analysis. 

Shear power spectrum Cosmic shear probes directly the metric perturbations, so that 
tracer bias is not a concern. The main source of nonlinearities on the observable scales 
is nonlinear gravitational effects, although dissipative baryon processes could conceivably 
contribute a few percent to the power spectrum at £ > 2000 — > 3000. At very high multipoles 
non-Gaussian effects from nonlinearities also become important, and an analysis in terms of 
the power spectrum breaks down. However, based on ray tracing, reference [30] finds this 
not to be a dominant effect unless the analysis is taken to beyond I ~ 10 . 
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In our analysis, we impose a redshift-independent cut-off at ^ ax = 2000. The specific 
value is unimportant, because for a EuCLiD-like survey, shot noise always dominates over 
the signal before this cut-off is reached. Although this relatively low ^ ax precludes any 
strongly nonlinear data from entering into our analysis, it is nonetheless important that we 
correct for nonlinearities on the mildly nonlinear scales. In practice this means that when 
constructing our mock shear data, we compute our fiducial shear power spectrum assuming 
the nonlinear corrections of HaloFit [31], including the recent recalibration of reference [32]. 
We take the view that these nonlinear corrections are calculable to the accuracy required 
to maintain an unbiased estimation of the cosmological parameters (or will be by the time 
real Euclid measurements become available), and analyse the mock shear data assuming 
HaloFit provides the correct nonlinear modelling. See, e.g., [33] for a recent study of the 
accuracy requirement on the nonlinear matter power spectrum in the context of dark energy 
parameter determination. 

Galaxy power spectrum Parameter inference from, e.g., the three-dimensional power 
spectrum of the SDSS LRG galaxies usually require that we cut off the spectrum at a 
wavenumber of k ~ 0.1 — > 0.2 h Mpc" 1 in order to limit the effect of scale-dependent galaxy 
bias. Alternatively, one may fold in the scale dependence in a bias model and marginalise 
over the model's nuisance parameters, as was done in, e.g., reference [34] using the P-model, 
and reference [10] using the Q-model. 

Here, we tackle the problem of scale-dependent bias by introducing in the galaxy angular 
power spectrum a redshift bin-dependent cut-off ^m* ax . Our procedure is motivated by two 
premises. Firstly, the sensitivity of the measurement (which, at large scales, is limited by 
cosmic variance) determines the required accuracy of the theoretical prediction. Secondly, 
the relative nonlinear correction to the matter power spectrum serves as an indicator of the 
scale at which nonlinear biasing effects start to become relevant. We therefore define 4iiax + 1 
as the smallest t value that satisfies the inequality 



where e n i is a nonlinear tolerance factor, Zi the mean redshift of bin i, V\\ n the linear matter 
power spectrum, and V n \ the nonlinear matter power spectrum corrected with HaloFit. We 
consider two choices of the tolerance factor e n \: 

• e n i = 0.1; this "linear only" scenario completely eschews any remotely nonlinear region. 

• e n i = 1; this slightly more optimistic case requires a modest, but not unrealistic amount 
of nonlinear modelling, and is our default choice. 

Generally, the higher the redshift, the smaller the nonlinear correction for a given length 
scale. At the same time, however, going to higher redshifts also changes the projection; the 
same scale is now projected to a higher £, where cosmic variance is less so that the demand for 
accuracy of the power spectrum higher. In practice, though, the wavenumber corresponding 
to ^max does not depend strongly on redshift; we find ~ 0.14 -> 0.16 Mpc" 1 in the default 
case, and ~ 0.06 — > 0.09 Mpc -1 in the linear only scenario. 

Shear-galaxy cross-correlation As discussed above, the scale-dependent bias for the 
galaxy auto-spectrum very quickly erases information at wavenumbers k larger than a cer- 
tain threshold. Shear-galaxy cross-correlation is in principle slightly less sensitive to this 




(4.2) 
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effect, since galaxy data contribute only to half of the measure. However, the corresponding 
threshold in /c-space turns out to be pushed back only by a small amount. To see this, take 
as an example the P-model introduced in [34] , which describes the scale-dependent bias as a 
super- Poisson shot noise, i.e., a component with no ^-dependence. On scales where the bias 
contribution is important, the power spectrum typically drops as k n , with n < —1.5. There- 
fore, if scale-dependent bias corrections become relevant for the galaxy auto-spectrum at febiasj 
then the corresponding threshold for the cross-spectrum should be at 2 _1 / n £;bias — 1-5 ^biasj 
which is a marginal shift. For this reason, we take for simplicity the same £ mSuX for the 
cross-correlation as for the galaxy auto-correlation. 

4.5 Synthetic Euclid data 

We summarise here the mock data sets we generate for the Euclid photometric survey. 

• A weak lensing shear auto-spectrum, ■, where the multipole I runs from 2 to 1^ = 
2000 independently of redshift. The indices i,j € [1,A S ] label the redshift bin, where 
the redshift slicing follows the scheme described in section 4.3. 

• A galaxy auto-spectrum, Cff ■, where the multipole runs from 2 to and the indices 

i, j € [l,iVg] label the redshift bin. The maximum multipole tmax is redshift-dependent 
because the scale at which scale-dependent galaxy bias becomes important varies with 
redshift, as discussed in section 4.4. 

• A shear-galaxy cross-spectrum, C^j -, where i 6 [1, N s ] labels the shear redshift bin and 
j € [1, Ng] the galaxy redshift bin. For the cross-spectrum, the maximum t values are 
always given by £|ii x of the galaxy redshift bin j. 

5 Forecasting 

Having outlined the mock data generation procedure, we are now in a position to describe 
our parameter error forecast for the Euclid photometric survey. The forecast is based on 
the likelihood method, in which we construct a likelihood function of the mock data and then 
explore its implications using Bayesian inference techniques. 

5.1 Model parameter space 

As a simple example, we consider a ACDM model extended with a nonzero neutrino mass 
specified by seven parameters: 

6 = (uj h ,uj dm ,h,A s ,n s ,z rc ,f u ), (5.1) 

where Ub = f^/i 2 is the physical baryon density, ujdm = ^dmh 2 the physical dark matter (cold 
dark matter and massive neutrinos) density, h = Hq/(W0 km s" 1 Mpc -1 ) the dimensionless 
Hubble parameter, ^4 S the amplitude of primordial scalar fluctuations, n s the associated scalar 
spectral index, z Te the redshift of reionisation, and the neutrino density fraction is defined 
here as f u = uj u /uj dm . Our fiducial model is defined by the parameter values 

Gfid = (0.022, 0.1126228, 0.7, 2.1 x 10~ 9 , 0.96, 11, 0.00553), (5.2) 

which we use to generate the mock data sets described in section 4.5. 
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The fiducial neutrino fraction /„ = 0.00553 corresponds to a neutrino energy density 
ijj v = 0.0006228, or a sum of neutrino masses ^ m v = 94.1 u u eV ~ 0.06 eV, roughly what one 
expects for a normal neutrino mass hierarchy in which the lightest mass eigenstate has a zero 
mass mi ~ 0. We implement this fiducial neutrino energy density as being contained entirely 
in one single neutrino species with mass ~ 0.06 eV, the other 2.046 species being exactly 
massless. While this is formally inconsistent with the mass spectrum of the normal hierarchy, 
namely, m,2 = mi + \J AraJ 2 — 0.009 eV and 711,3 = m i + \/ Am'f 2 + ^Am^ — 0.05 eV, 3 our 
simplified model is observationally indistinguishable from the correct one unless the survey 
sensitivity is much better than the mass splittings (see section 6.6 and also reference [37]). 
Except in section 6.6, the following parameter estimation analysis will also assume only one 
massive neutrino and 2.046 massless neutrinos. 

5.2 Likelihood function 

Because we are concerned only with parameter error estimation, not model comparison, we 
may define our likelihood function C in such a way that x%i = — 21n>C is exactly equal to 
zero for the fiducial parameter values. If the covariance matrix of the data set has dimen- 
sion N x N, this amounts to defining Xeg as 



where Cp = Cy-r, is the mock data covariance matrix, while Cp = CyC: is the total theoretical 
covariance matrix of equation (4.1). 

From the three mock Euclid data sets we generate, four different combinations are 
possible: 

• Galaxy auto-spectrum only, N = N g . 

• Shear auto-spectrum only, N = N s . 

• Galaxy and shear auto-spectra, with no shear-galaxy cross-spectrum. In this case 
Xeff = Xeff g + Xeff s' wnere the two contributions have N = N s and N = N s respectively. 

• All galaxy and shear power spectra including shear-galaxy cross-correlation, N = N g + 



5.3 Parameter inference 

We use the nested sampling algorithm [38], as implemented in the MultiNest [39, 40] add-on 
to the CosmoMC [41] parameter estimation code, to infer constraints on the parameters of our 
cosmological model (5.1). 

Because the full model parameter space is quite large and some of the parameters, espe- 
cially a;;, and z Te , are badly or completely unconstrained by large-scale structure observations, 
we use in addition a set of mock CMB TT, EE and TE data simulated following the pro- 
cedure of [9] according to the specifications of Planck [3], using information from the 100, 
143 and 217 GHz channels. We do not model foreground signals, beam uncertainties, etc., 
but instead cut off the spectra at a maximum multipole of ^aL B = 2000 in both temperature 
and polarisation. 

3 We use Am? 2 = 7.5 x 10" 5 eV 2 and |Aml 3 | = 2.3 x 10~ 3 eV 2 [35, 36]. 




(5.3) 
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5.4 Treatment of galaxy bias 

While we have adopted a very conservative cut in the high multipoles of the galaxy angular 
power spectra in order to avoid the issue of scale-dependent galaxy bias, the galaxy number 
density fluctuations still differ from the underlying matter density perturbations by a redshift- 
dependent linear bias factor. 

A realistic linear bias model should be a continuous and monotonic function of redshift, 
one that is probably describable in terms of only two or three free parameters. In the absence 
of a concrete model, however, we consider two extreme assumptions about the linear bias 
when analysing our mock galaxy data: 

1. In the most optimistic scenario, the linear bias factor is perfectly known, so that the 
galaxy power spectrum measures directly the underlying matter power spectrum with 
no loss of information. In this case, without loss of generality, we can set the linear 
bias factor in every redshift bin to hi = 1. 

2. In the most pessimistic scenario, we have no knowledge of the linear bias at all, which 
makes it necessary to postulate N g independent linear bias factors, one for each of the 
iVg redshift bins. These iVg linear bias parameters are let to vary in our parameter 
inference, and marginalised at the end of the analysis. Under this assumption, not only 
do we discard information coming from the normalisation of the galaxy density fluctu- 
ations, we also throw away information about the growth of structures as a function of 
redshift. 

6 Results 

As discussed above, given the survey specifications, there are a number of ways in which we 
may construct our mock data sets and subsequently analyse them. For both cosmic shear 
and galaxies, we have the freedom to select the subdivision of the data according to redshift. 
For galaxies, we additionally need to specify a bin-dependent £m ax and how to deal with the 
galaxy bias. The choices we make will eventually affect the parameter constraints derived 
from the data. 

We begin this section by assessing the impact of our assumptions on cosmological pa- 
rameter inference from the individual shear and galaxy mock data sets together with mock 
Planck CMB data. After that, we consider all data sets in combination, and determine the 
best parameter sensitivities attainable by Euclid. 

6.1 Varying the number of redshift bins 

Given our assumptions of an unbiased redshift determination and a known photometric 
redshift uncertainty, one might expect that increasing the number of redshift bins Abm will 
always lead to smaller uncertainties in parameter estimates, until the bin widths become com- 
parable to the photometric redshift error, or the galaxy surface density too small to maintain 
a sizeable signal-to-noise ratio. In reality, however, the gain in information on cosmological 
parameters from redshift binning often saturates well before this point is reached. This is 
understandable for parameters that have a negligible effect on the growth of structures as a 
function of redshift, e.g., the parameters that describe the primordial perturbations. How- 
ever, because the growth function evolves slowly and monotonically with redshift in most 
reasonable cosmologies, signals in neighbouring redshift bins can become highly correlated, 
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Figure 4. Standard deviation of the marginalised posterior distribution for Wd-m (t°P left), h (top 
right) and J^ 171 ^ (bottom), derived for various combinations of synthetic Planck CMB data and 
Euclid data. Red: PLANCK+shcar auto-spectrum. Black/Grey: PLANCK+galaxy auto-spectrum 
without /with marginalisation over A/bin bias parameters. Dotted lines denote galaxy data on linear 
scales only (e n i = 0.1), while solid lines include data on mildly nonlinear scales (e n i = 1). 



so that even parameters that do have a strong impact on the growth function may be subject 
to the said saturation effect. 

Figure 4 shows how the standard deviation of the marginalised posterior probability dis- 
tribution for Wdm, h, and ^ m u evolves as a function of the number of bins, using Planck 
CMB data combined with either shear or galaxy measurements, sliced in redshift bins. 
The remaining four parameters of our seven-parameter model are almost completely con- 
strained by Planck ; adding shear or galaxy data to the inference leads only to a small 
reduction (up to ~ 20%) in the error bars. 

For PLANCK+shear, saturation occurs already at two bins. The improvement gained 
from going from one bin to two bins can be attributed to the fact that the latter improves 
the constraints on and h through a measurement of the growth function, even if in a 
convoluted form, at two different redshifts. (In ACDM-type cosmologies, the growth function 
is governed only by oj m = Wdm + and h; see equation (A. 9).) But a finer redshift binning 
adds little value to parameter inference in our simple seven-parameter model, because more 
bins would merely lead to an overdetermination of the two parameters that control the growth 
function. For more complex model spaces, however, especially those with dynamical dark 
energy described by multiple parameters, employing three or more tomography bins may lead 
to yet more improvement in the parameter constraints (e.g., [18]). Note that our forecast 
sensitivities to ^ m u for PLANCK+shear are somewhat better than the findings of [42]. This 
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Table 1. Posterior standard deviations for the parameters Wdm, h, and J^m^, derived from various 
combinations of data sets. Here, "c" denotes Planck CMB data, "g" galaxy auto-spectrum (11 bins), 
"s" shear auto-spectrum (2 bins), and "x" shear-galaxy cross-correlation. The subscripts "1" and "b" 
refer respectively to a galaxy data set with e n \ =0.1, and full bias marginalisation. 



Data 10 a x o-(ajdm) 100 x a(h) ajY.m^JeV 



c 


2.02 


1.427 


0.143 


cs 


0.423 


0.295 


0.025 


eg 


0.583 


0.317 


0.016 


cgi 


0.828 


0.448 


0.019 


Cgb 


0.723 


0.488 


0.039 


Cgbl 


1.165 


0.780 


0.059 


csg 


0.201 


0.083 


0.011 


csgx 


0.181 


0.071 


0.011 


CSgb 


0.385 


0.268 


0.023 


csg b x 


0.354 


0.244 


0.022 



is due to a larger model parameter space and lower assumptions about the survey performance 
adopted in the earlier analysis. 

For Planck combined with the galaxy auto-spectrum, there continues to be some 
improvement up to around ten bins. This is because a finer redshift binning in conjunction 
with our prescription for constructing the bin-dependent 4n ax allows us to measure the power 
spectrum up to higher multipoles (where cosmic variance is smaller) at high redshifts (see 
figure 3). In turn, a higher ^m ax lends a longer lever arm to probe the shape of the matter 
power spectrum and its small-scale amplitude. 

6.2 Bias modelling and small-scale cut-off 

As expected, bias marginalisation greatly degrades the galaxy auto-spectrum's sensitivity to 
cosmological parameters. In the case of ^2m u , for example, figure 4 shows a factor of two 
to three increase in the uncertainty. This suggests that for the low fiducial neutrino masses 
considered in this work, the full galaxy power spectrum, including normalisation information, 
constrains ^ m„ predominantly through a measurement of the spectrum's absolute amplitude 
on small scales, rather than through a detection of the characteristic step-like feature at the 
neutrino free-streaming scale, or the impact of ^2 rn y on the growth function. 

Further evidence to support this point can be seen in figure 4, when we discard the 
mildly nonlinear scales in the galaxy angular power spectrum in our analysis, i.e., we decrease 
e n i from 1 to 0.1. If the galaxy bias is exactly known, the degradation in the parameter 
uncertainties is about 30 to 40% for and h, and always less than 20% for ^ m„, with the 
loss of sensitivity being more pronounced for finer redshift binnings. However, if the linear 
galaxy bias is unknown, then the additional loss of information on the mildly nonlinear scales 
has a more dramatic effect on the parameter constraints. For ]P m u , we find that between no 
nonlinear information and with nonlinear information, the reduction in sensitivity is about 
40 to 60%. This tells us that the extended "lever arm" afforded by the mildly nonlinear scales 
is more important for the power spectrum shape measurement than for the determination of 
the small-scale spectrum amplitude. 
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Figure 5. Forecast two-dimensional joint marginalised 68%- and 95%-crcdible contours in the 
(cj m , h, ^2 m,/)-subspace. Black: Planck CMB data+galaxy auto-spectrum (11 bins) without bias 
marginalisation. Red: Planck CMB+shear auto-spectrum (2 bins). Green: PLANCK+galaxy auto- 
spectrum+shear auto-spectrum+shear-galaxy cross-correlation. 

6.3 Combining data sets 

Having studied the parameter sensitivities of CMB combined with either shear or galaxy 
data, we now proceed to consider shear and galaxy measurements in combination. Based on 
the conclusions of section 6.1, we choose to perform the combined analysis using two shear 
and eleven galaxy redshift bins. The results of this analysis are summarised in table 1. 

Somewhat surprisingly, we find that as long as the galaxy bias is known, constraints on 
Wdm; h, an d ^2 m u in the combined analysis are dramatically better than what one would 
naively expect from looking at the sensitivities derived from the PLANCK+shear ("cs") and 
PLANCK+galaxies ("eg") analyses. Most notably, a determination of ^2m u with an error 
as small as a(^2m u ) = 0.011 eV may be possible combining Planck, shear and galaxy 
data ("csg"). Such a sensitivity would imply a detection of a nonzero neutrino mass at 
greater than five standard deviations, even in the most pessimistic case of a massless lightest 
state in the normal mass hierarchy. Galaxy bias marginalisation reduces the sensitivity to 
o-(J2 m u) = 0.023 eV ("csgb"), which still represents a ~ 2.5a detection in the worst case. 

A first hint at an explanation for this dramatic improvement in the sensitivities can be 
gleaned from examining the joint two-dimensional 68%- and 95%-credible contours in the 
(%, h, ^2 m^)-subspace of our seven-dimensional model parameter space (the other param- 
eter directions are essentially decoupled). These are shown in figure 5. Already, we can 
see from these two-dimensional results that the degeneracy directions in the (u m , h, Y2 m u )- 
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Figure 6. Three-dimensional scatter plots of samples with xts — ^ from the Markov chains of 
our three-parameter fits. Black: Galaxy auto-spectrum only. Orange: Planck CMB data+shear 
auto-spectrum. 

subspace are somewhat different for PLANCK+shear and for PLANCK+galaxies, so that in 
combination they break each other's degeneracies. 

A better illustration of this degeneracy-breaking can be found in figure 6. Here, we show 
a three-dimensional scatter plot of the Markov chain samples with XeS — 2 from a three- 
parameter fit of the PLANCK+shear data and the galaxy-only data, varying in both cases 
only (w m , h, Tn u ). When viewed from a suitable perspective, it is clear from the scatter plot 
that although the region of parameter space preferred by the galaxy-only data is generally 
much larger than that favoured by PLANCK+shear, the two regions in fact have very little 
overlap, and cut into each other in a way that improves significantly the constraints on all 
three parameters. 

For a more quantitative, semi-analytic discussion of the degeneracy directions preferred 
by the different data sets in the (w m , h)- and (w m , h, ^ m^)-subspaces, we refer the reader to 
appendix A. 

6.4 The effect of shear-galaxy cross-correlation 

As we saw earlier in figure 2, shear and galaxy clustering are anti-correlated with one another. 
Thus, one might reasonably expect the addition of shear-galaxy cross-correlation to our com- 
bined analysis to yield even tighter parameter constraints, especially because cross-correlation 
is not subject to shot noise (although it is of course still subject to cosmic variance). 

However, as shown in table 1, the inclusion of shear-galaxy cross-correlation improves 
the parameter uncertainties only marginally. Comparing the "csgx" and the "csg" fits (and 
also "csgbx" and "csgb"), the improvement gained from shear-galaxy cross-correlation is of 
order 10%. One reason for this modest gain is that only for very few combinations of shear and 
galaxy redshift bins is the cross-correlation actually significant, e.g., low-redshift shear will 
almost be uncorrelated with the high-redshift galaxy clustering. Another reason is that over 
most of the range of multipoles we analyse, both the galaxy and the shear auto-spectra are 
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already limited only by cosmic variance; there is thus little to gain from the cross-correlation 
being noise-free. 



6.5 The role of linear spectral shape evolution 

One distinctive signature of massive neutrinos is the scale-dependence of the linear growth 
function G hn (k,z) induced by neutrino free-streaming. This scale-dependence means that 
/c-modes generally grow at the different rates, so that the shape of the linear 3D matter 
power spectrum, especially the degree of power suppression at large k due to non-clustering 
neutrinos, effectively changes with time. However, this spectral shape evolution is typically 
a much smaller effect than the actual free-streaming suppression itself [43], and until we can 
measure the suppression with high significance, it will most likely remain unobservable. To 
this end, in view of the extraordinary sensitivity of a EuCLiD-like survey to the neutrino 
mass, we would like to reexamine the detectability of linear spectral shape evolution in this 
section. 

We consider three fiducial neutrino mass sums ^2m u : (i) the default value of 0.058 eV 
used so far in our analysis, (ii) 0.11 eV, corresponding to the minimum mass of the inverted 
hierarchy (see also section 6.6), and (iii) 0.465 eV, which is roughly the 95% upper limit on 
^2 m v from the current generation of cosmological probes. For each of these fiducial models 
we generate mock data sets as per usual, but fit them using a modified linear transfer function 
that has the shape of the z = linear transfer function and an amplitude scaled by the k — > 
linear growth functions, i.e., 

9,8 V ' > G lm (fc^0,0) *> SK ' ; T l ™ s {k^ 0,0) *' 5V ; V ' 

Nonlinear corrections (which also induce a scale-dependence in the effective growth functions) 
are computed with Halofit using this modified linear transfer function as an input. If linear 
spectral shape evolution does indeed constitute an observable effect, it will manifest itself in 
the fit as a bias in the parameter estimates and/or a degradation of the best-fit xls- 

Figure 7 shows the two-dimensional joint marginalised 68%- and 95%-credible contours 
in the (h, ^ m l/ )-subspace from this exercise, computed using the most stringent "csgx" data 
set. These two parameters suffer the strongest bias when linear spectral shape evolution has 
been deliberately ignored. Nonetheless, unless Yl rn u 1S close to or beyond its present-day up- 
per limit, the deviation induced is considerably below la; even in the case of ^ = 0.465, 
we do not expect the parameter bias to exceed ~ 1.5<r. 

A somewhat stronger effect can been seen from an examination of each model's goodness- 
of-fit relative to that of the true underlying model. For the three scenarios considered here, 
we find, in ascending order of fiducial neutrino masses, A x 2 oS = L92 ' 2 - 89 ' 7 - 68 - Thus ' in the 
very optimistic case of a true ^ m u that is as large as is presently allowed by observations, a 
model comparison analysis may yield some mild evidence for the time evolution of the linear 
spectral shape. These numbers also indicate that spectral evolution due to neutrino free- 
streaming plays but a negligible role in the constraint of the neutrino mass by a EuCLiD-like 
survey. 

6.6 Modelling of the neutrino mass spectrum 

In our analysis so far we have always assumed one massive and 2.046 massless neutrinos. 
This mass spectrum is clearly inconsistent with both the normal (NH) and the inverted (IH) 
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Figure 7. Two-dimensional joint marginalised 68%- and 95%-credible contours in the (h, m y )- 
subspace from a "csgx" fit, where the analysis has been performed using the modified linear transfer 
function (6.1) which has no shape evolution. Black dots mark the fiducial values of ^2m u and h in 
each model. 



hierarchies stipulated by neutrino oscillation experimental data: 



mi < ni2 = mi + y Araj 2 < 771.3 = mi + y Amf 2 + y Am| 3 , NH, (6.2) 



7713 < mi = TO3 + J Am^ — y Amj 2 < m2 = m% + y Am^, IH, (6.3) 

where Amf 2 = 7.5 x 10" 5 eV 2 and |Am^ 3 | = 2.3 x 1CT 3 eV 2 [35, 36]. In view of the re- 
markable forecast sensitivities to m v presented in table 1 , one might wonder if our simple 
"1 massive+2 massless" scheme will remain an adequate approximation in the face of real 
Euclid data. 

The question of whether the neutrino mass spectrum could ever be resolved by precision 
cosmology has been discussed previously in many different contexts (see, e.g., [37, 44]). Here, 
we wish to establish (i) if using an approximate model would bias the parameter inference (in 
^2i m v ° r other parameters), and (ii) if an approximate model would provide a much worse 
fit to the data than the true model, so that the former could be excluded by way of model 
comparison. 

Our set-up is simple. Assuming a fiducial sum of m u = 0.11 eV, we generate two 
mock data sets, one with three non-degenerate neutrino masses distributed according to the 
normal hierarchy, and the second distributed according to the inverted hierarchy. Each of 
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Figure 8. Marginalised posterior probability density for the sum of neutrino masses from a "csgx" 
fit, where the analysis has been performed in the 1+2, 2+1, and 3+0 schemes. The fiducial model has 
Y rriv =0.11 eV (marked by the dotted vertical line) and assumes the normal (left) and the inverted 
(right) hierarchy, respectively. The shaded area indicates the minimal 68%-credible interval for the 
worst-fitting scenario. 



these two sets is analysed using three different approximate models for the neutrino mass 
spectrum: 

• "1+2": 1 massive state, 2.046 massless states 

• "2+1": 2 degenerate massive states, 1.046 massless state 

• "3+0": 3 degenerate massive states, 0.046 massless state 

The choice of ^ m v = 0.11 eV as the fiducial sum is motivated by the fact that it is roughly 
the minimum value allowed by the inverted hierarchy according to equation (6.3). 

Figure 8 shows the one-dimensional marginalised posteriors for m f from our six fits, 
using a combination of Planck CMB, galaxy (no bias marginalisation) , shear, and shear- 
galaxy cross-correlation data ("csgx", using the notation of table 1). Expectably, of the 
seven free parameters in our model, the estimation of Yl m v is most strongly affected by 
how we choose to model the neutrino mass spectrum. Even so, the bias in the inference of 
Y2, m v caused by a bad model choice is at worst a la effect, one that we can easily live with. 
Likewise, in all cases, the goodness-of-fit as quantified by the model's best-fit x^s suffers only 
a degradation of at most A%^ ff = 0.9 relative to the true model, indicating that the data show 
no strong preference for any particular neutrino mass spectrum. We therefore conclude that 
even under the best circumstances, the combination of Planck and Euclid data will not 
be able to probe directly the neutrino mass hierarchy, and hence, an exact modelling of the 
neutrino mass spectrum for the purpose of cosmological parameter inference is unnecessary. 

Finally, we observe in figure 8 that for both the normal and the inverted hierarchy, 
the 2+1 scheme appears to be the most unbiased approximation. While this makes perfect 
sense for the inverted hierarchy given the fiducial sum of Yl l m v = 0.11 eV, we note that 
in the normal hierarchy, this fiducial model in fact corresponds to one state with a 0.05 eV 
mass, and two almost degenerate, slightly lighter states at 0.03 eV. The best approximate 
mass spectrum should therefore lie somewhere between the 1+2 and the 3+0 models. The 
2+1 scheme happens to satisfy this "in-between" requirement and hence provide an almost 
unbiased fit to the mock data, although strictly speaking it is a very poor approximation to 
the underlying physics. 
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Figure 9. Forecast two-dimensional joint marginalised 68%- and 95%-crcdible contours in the 
(u) m , h, m u, iu)-subspace of the 9-parameter model (equation (7.1)). Black: Planck CMB 
data+galaxy auto-spectrum (11 bins) without bias marginalisation. Red: Planck CMB+shear auto- 
spectrum (2 bins). Green: PLANCK+galaxy auto-spectrum+shear auto-spectrum+shear-galaxy cross- 
correlation. 



7 More complex cosmological models 

So far we have derived our constraints on neutrino masses and other cosmological parameters 
in the context of a simple 7-parameter model, namely, a ACDM cosmology extended with 
one additional parameter ^2m u . However, previous studies have shown that Y^ m v, as well 
as the physical matter density lo and Hubble parameter h, can be highly degenerate with 
physics not encompassed by this simple 7-parameter model, for instance, a nonstandard value 
for the effective number of massless neutrino species N!3f, or a dark energy equation of state 
parameter w that deviates from —1 (see, e.g, [45-49]). It is therefore interesting to ask if 
the extremely tight constraints on oj m , h and m v reported in section 6 would survive in a 
more complex model parameter space. 

To test this, we perform a forecast for a 9-parameter model described by 

@9 = (wb, "dm, h, A s ,n s , z m , f u , w), (7.1) 

using the "csgx" data set. We choose the fiducial values 

8 g fld = (0.022, 0.1126228, 0.7, 2.1 x 10~ 9 , 0.96, 11, 0.00553, 2.046, -1), (7.2) 

i.e., the same fiducial values as those used earlier in our 7-parameter fit (see equation (5.2)), 
with the standard model prediction for iV^ 1 and dark energy in the form of a cosmological 
constant. Note that we assume, as we did before, one massive neutrino species; only the 
number of massless species is now free to vary. 
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Table 2. Posterior standard deviations for the parameters Wdm, h, ^m„, Njjg, and w in the 9- 
paramctcr model, derived from the "cs", "eg", "csgx" and "csgbx" data sets. For easier comparison 
we tabulate also the corresponding values from the "csgx" and "csgbx" sets in the 7-parameter model. 



Data 


10 3 x o-(w dro ) 


100 x a(h) 


cr(X]»n„)/eV 




<t(w) 


cs 


1.864 


0.638 


0.064 


0.081 


0.0310 


eg 


1.121 


0.655 


0.020 


0.086 


0.0163 


csg 


0.885 


0.324 


0.012 


0.056 


0.0083 


csgx 


0.874 


0.292 


0.012 


0.055 


0.0065 


CSgb 


1.400 


0.529 


0.042 


0.068 


0.0207 


CSgbX 


1.390 


0.482 


0.042 


0.068 


0.0186 


csgx (7-parameter) 


0.181 


0.071 


0.011 






csgbx (7-parameter) 


0.354 


0.244 


0.022 







Figure 9 shows the correlations between mv, tJ m , h) and the two new parameters. 
A strong three-way degeneracy persists between (u m , h, N^), since these are the main pa- 
rameters that control the epoch of matter-radiation equality and the CMB angular sound 
horizon (see section II of [50] for an explanation). This three-way degeneracy can lead to 
an even stronger apparent degeneracy between (w m , h, ^ m v ) (and, by association, between 
m u and N"g) in the matter power spectrum, although the degeneracy is not an exact one 
and once shear and galaxy data are used in combination, only a small correlation remains 
between these parameters. However, while the (ui m , h, A^)-degeneracy does not translate 
into a significant degradation in the sensitivity to the neutrino mass, it does nonetheless turn 
the mildly negative correlation between cu m and m v previously seen in the 7-parameter fit 
into a positive one. The (Y m„, w)-degeneracy, on the other hand, is completely broken be- 
cause of the combination of two independent determinations of the growth function G(k, z), 
which tightly limits the (w,j m , h, w)-degeneracy inherent in G(k,z). 

To check that it is indeed the combination of galaxy and shear data that saves the bound 
on m v in the extended model, we perform two separate forecasts using PLANCK+galaxy 
("eg") and PLANCK+shear ("cs") respectively, the results of which are shown also in figure 9. 
Clearly, removing either the galaxy or the shear data from the combined fit restores the strong 
degeneracy between uj m and ^ m ^> an d hence the degeneracies of J2m u with w and A^ 1 . 
The same trend can also been seen in table 2, where we list the posterior standard deviations 
from various cosmological parameters. In the 9-parameter model, the sensitivities to ^ m„ 
and w degrade both by a factor of five when galaxy data are removed, and by a factor of 
two and three, respectively, when shear data are discarded. This is in contrast with the 
7-parameter model, where the corresponding degradations in the m sensitivity are 200% 
and 50% respectively according to table 1. Thus, we conclude that the simple combination of 
galaxy and shear data is even more powerful for constraining the extended model parameter 
space than for the minimal 7-parameter model. Shear-galaxy cross-correlation, on the other 
hand, continues to play a marginal role in the parameter constraints; excluding "x" from the 
analysis deteriorates the sensitivities by no more than 10%. 

Lastly, we examine the impact of galaxy bias marginalisation on the parameter sensi- 
tivities in the 9-parameter model. Full marginalisation over an unknown linear galaxy bias 

4 The sound horizon depends also on the baryon density uJb- However, uib can be very well constrained by 
the CMB even-to-odd peak ratios, and is therefore effectively decoupled from the degeneracy problem. 
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means a complete loss of information on the normalisation and hence the growth function 
G(k, z) from the galaxy data, the result of which is a severe degradation in parameter sensi- 
tivities to (w,j m , h, w) because of their mutual degeneracy in G(k, z). As shown in table 2, the 
sensitivities of the "csgbx" data set to (wdm, h, w) degrade by a factor 1.5 to 3 relative to the 
numbers from "csgx". In contrast, the sensitivity to iVZSr suffers only a 20% setback, under- 
standably because N^g does not play a role in the late-time normalisation of matter power 
spectrum. For ^2m u , the unbroken (u^ m , h, u;)-degeneracy translates into a degradation of 
the la sensitivity to 0.042 eV, and reduces the detection significance to only 1.5<r should the 
true m u be at its most pessimistic value of 0.06 eV. Shear-galaxy cross-correlation plays 
essentially no role in any of these constraints. Nonetheless, we note that compared with using 
shear data alone ( "cs" ) , the addition of unnormalised galaxy data to the analysis still offers a 
20% to 30% improvement in most parameter sensitivities; shape information still contributes 
some constraining power. 

8 Conclusions 

In this work, we have carefully studied the capacity of a future photometric galaxy survey 
such as Euclid in terms of its constraining power on the neutrino mass. Several previous 
studies have also considered the idea of using either the cosmic shear or the galaxy clustering 
power spectrum from a EuCLiD-like survey in conjunction with CMB measurements from 
the Planck mission to probe the neutrino mass scale (e.g., [42, 44, 51-54]). Indeed, as a 
first test, we have confirmed that PLANCK+shear and PLANCK+galaxy power spectrum are 
individually sensitive to the neutrino mass sum m v a t the cr(%2 m v ) = 0.02 — > 0.04 eV 
level depending on our knowledge of the linear galaxy bias, compatible with earlier findings. 

An interesting interplay arises, however, when shear and galaxy clustering data are 
used in combination. Here, as we demonstrate in a new study, the parameter degeneracy 
directions in the shear and the galaxy data in the three-dimensional (w m , h, ^ m J/ )-subspace 
are fundamentally very different. This means that the two data sets, although seemingly 
measuring the same thing, are in fact complementary to one another, and when used in 
combination, conspire to lift each other's parameter degeneracies. For the neutrino mass 
measurement, this complementarity between the shear and the galaxy power spectra can 
lead to a significant improvement in the survey's sensitivity to m v- m ^ ne case °f complete 
knowledge of the linear galaxy bias, the sensitivity can potentially reach cr(^ m„) = 0.011 eV, 
which will constitute a 5.4<r detection of ^ m„ even if the true ^ m„ has the lowest value of 
~ 0.06 eV allowed by neutrino oscillation experiments in the normal hierarchy. In the opposite 
extreme, however, where no knowledge of the bias is available, the improvement anticipated 
in o~(^2 m v ) from combining shear and galaxy measurements is greatly diminished, mainly 
because the galaxy data no longer contain any useful information on the amplitude of the 
matter power spectrum. Nonetheless, a sensitivity of cr(^ m v ) = 0.022 eV, or a detection 
significance of 2.5er, can still be expected. 

We have also verified explicitly that a similar sensitivity of m v ) = 0.012 eV can be 
attained in a more complex cosmological model extended with two additional free parameters, 
the effective number of relativistic degrees of freedom and the dark energy equation of 
state parameter w, if the linear galaxy bias is exactly known. In the complete absence of this 
knowledge, however, the sensitivity to Yl m u deteriorates to 0.042 eV, thereby reducing the 
detection significance to 1.5<7 should the true m v be a pessimistic 0.06 eV. 
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Given these spectacular sensitivities, we have also examined if a EuCLiD-like survey 
might be able to resolve the neutrino mass spectrum, and hence distinguish between the 
normal and the inverted hierarchy in a direct way. Taking a fiducial cosmological model with 
~Y^m v = 0.11 eV and three non-degenerate neutrino masses distributed according to the 
normal or the inverted hierarchy, we find that parameter estimation, even under the most 
aggressive model and data assumptions, suffers no significant bias (less than la) when a 
grossly incorrect neutrino mass spectrum has been assumed in the statistical inference. The 
same is true also when we consider the goodness-of-fit of the wrong models; the degradation 
in XeB 1S always less than one unit relative to the true model. 

Thus, we conclude that although a survey like Euclid will very likely provide a pos- 
itive detection of neutrino mass at high significance, the exact nature of the neutrino mass 
spectrum remains out of its reach, unless the total neutrino mass is sufficiently low to rule 
out the inverted hierarchy. 
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A Parameter degeneracies 

In this appendix we examine more closely the degeneracies amongst the parameters iv m , h, 
and Yl m v in the context of restricted two-parameter and three-parameter fits wherein only 
these parameters or a subset of them are let to vary. As already discussed in section 6.1, 
these three are the only cosmological parameters in our seven-parameter model (5.1) for 
which the addition of galaxy and/or shear auto-spectrum measurements to Planck CMB 
data is expected to improve significantly their uncertainties. 

We begin with a two-parameter analysis varying only oj m and h, after which we extend 
to model space to include also a free ^2m u . 

A.l Two-parameter fits 
A. 1.1 CMB only 

The most well-measured feature in the CMB temperature anisotropies is the angular sound 
horizon 6 S = r s (z+) / Da{Q, z±). In the numerator, r s (z±) is the physical sound horizon at 
decoupling z+ ~ 1100, 



r s (z±) 



dr c s (r) 



/4 1 a(z ca , 
~ 3000W- zk In 



6 iO m 



eq j 



y/1 + R{Z*) + yjgg + R(Ze q ) 
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Mpc, 



(A.l) 



where R(z) = (3/ 'A)(oj^/uj-y)a(z) is the baryon-to-photon ratio, and 1 + z cq = (oj m /uj 1 )(l + 
Q.227^)- 1 , with w 7 = 2.4692 x 10" 5 , gives the redshift of matter-radiation equality. In the 
denominator, 
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Figure 10. Approximate principal degeneracy directions and forecast 68%- and 95%-credible contours 
for h and w m for the two-parameter model. All other cosmological parameters are kept fixed at their 
fiducial values. Black: Galaxy auto-spectrum (8 bins) without bias marginalisation. The straight line 
indicating the degeneracy direction has slope M = —21. Grey: Galaxy auto-spectrum data (8 bins) 
with bias marginalisation. The corresponding slope is M = 6.5. Red: Shear auto-spectrum data 
(8 bins), with M = 9.5. Gold: Planck CMB data, with M = -4.2. 



denotes the comoving angular diameter distance to the last scattering surface under the 
assumption of a flat spatial geometry. 

We adopt fixed values for Wb = 0.022 and N u = 3.046, and vary w m and h around their 
fiducial values of cj^ d = 0.1346228 and h M = 0.7. Assuming 6 S can be parameterised as 



Um/i" in the vicinity of h ), perturbing around this point gives 



d0 s 



dh 
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with coefficients 
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gives the slope of the degeneracy ellipse in the (x = uj m , y = /i)-plane. 

Figure 10 shows the posterior in the (w m , fo)-plane for various mock data sets. The de- 
generacy slope predicted in equation (A.6) matches exactly the degeneracy direction preferred 
by the mock Planck CMB data. 
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A. 1.2 Galaxy angular power spectrum 

Shape information only Full marginalisation over the linear bias as discussed in sec- 
tion 5.4 forces us to discard all normalisation information, including information on the 
linear growth of structures as a function of redshift. In this case, the only handle by which 
cosmological parameter estimation is possible is through the shape of the angular power 
spectrum, notably through the "shape parameter" k cq = a(z cq ) H (z cq ) , which determines 
primarily the location of the turning point of the 3D matter power spectrum. 

After projection onto i-space, the equivalent feature in the galaxy angular power spec- 
trum can be found at 

4 q = k cq D A (0, zi) K M f dz ^ (A.7) 



where we have used k cq ~ w m to obtain the last approximate equality. Equation (A.7) 
expresses a positive correlation between w m and h for all reasonable source galaxy redshifts Z{. 
Setting the source redshift explicitly to Zi = 0.8 (i.e., the median redshift of the Euclid 
survey) and using the power-law approximation £ cq = ujmh u , we find [/, ~ 0.86 and v ~ —0.71 
in the vicinity of the fiducial model. These numbers corresponds to a slope of M ~ 6.3, 
which compares very well with M = 6.5 from our numerical fit shown in figure 10. 

Including normalisation information The role of normalisation information, especially 
at high multipoles, can be gleaned by a close inspection of the actual expression for the 
galaxy angular power spectrum. Supposing a source galaxy redshift distribution that is 
sharply peaked at Zi, the angular power spectrum can be broken down in the Limber limit 
into 

C !% ~ * H 2 { nSn Zl \ T ^ k = '/^(0, Zi)] Vn[k = £/D A (0, *)], (A.8) 

up to a cosmology-independent constant factor. Near scale-invariance of the primordial scalar 
perturbations renders Vn(k) an almost constant factor independent of the parameters u m 
and h of interest. We shall assume Vn(k) to be exactly constant in the following. 

Note that in writing down equation (A.8) we have explicitly split up the transfer func- 
tion Tq,(k,z) into a part that contains exclusively the ^-dependence, To(k), and one that 
encodes the redshift evolution, i.e, the growth function G{z). This splitting is in principle 
strictly valid only in the limit linear perturbation theory holds, and when gravitational clus- 
tering at low redshifts has no additional scale dependence (due to, e.g., hot dark matter, 
modified gravity, etc.). In ACDM cosmologies, the (linear) growth function G{z) can be 
written as 

k r°° 1 _|_ y' 

G(z) = ^u m H(z) J ^'jfAry (A.9) 

up to a cosmology-independent conversion factor between Hq and h. For To (A;), the well- 
known BBKS transfer function [55], 
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■[l+3.89g+(16.2g)^ + (5.47g) d + (6.71g) 4 ]- u -^, (A.10) 



suffices as a simple and convenient approximation of To(k). 

To evaluate the degeneracy direction in the (w m , /i)-plane, we use again the power-law 
approximation Cff - = Wmii" , and perturb around the fiducial model. Assuming the median 
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source redshift Z{ = 0.8 and i = £m\x — 400, we find /j, ~ 1.2, v ~ 0.29, and hence a 
degeneracy slope of M ~ —22, which matches closely M = — 21 found in the actual fit in 
figure 10. Interestingly, the inclusion of normalisation information now leads to a negative 
correlation between cj m and h, in contrast to the positive correlation we found earlier from 
shape information only. 

A. 1.3 Shear angular power spectrum 

Shear measurements do not suffer from tracer bias effects, and therefore always contain 
normalisation information. To understand the degeneracy in the (u) m , /t)-plane, we again 
assume a sharply peaked source galaxy redshift distribution at Zi, and break down the shear 
power spectrum in the Limber limit as 

2 

T$[k = £/D A (0, z)] V n [k = l/D A (0, z)}, 

(A.ll) 

up to a cosmology-independent constant factor. The expression can be further simplified if 
we assume (i) exact scale- invariance of the primordial perturbations, which reduces Vn(k) 
to an irrelevant constant, and (ii) that the gravitational lenses are all concentrated at one 
redshift z = Z\ em . This last assumption amounts to replacing all functional dependences on 
z in equation (A.ll) with a dependence on z\ cns , so that no z- integration is required. For 
Zi = 0.8, the choice of z\ ens = 0.5 should suffice. 

Assuming C| s ^ = i^li", we find [i ~ 2.8 and v ~ —2.5, so that M ~ 5.9 in the vicinity 
of the fiducial model. These numbers have been obtained assuming I ~ 1000, although other 
choices of £ yield virtually the same result for the slope M. In contrast to our previous 
results, this value of M is not a very good match to M = 9.5 from our numerical fits in 
figure 10. This mismatch is most likely due to our inadequate modelling of the nonlinear 
transfer function (the BBKS transfer function (A. 10) describes only the linear regime), which 
is important for cosmic shear at large angular multipoles. Despite this mild inconsistency, 
our result does demonstrate that the correlation between co m and h from shear alone is 
positive. This is interesting when compared with the negative correlation preferred by the 
galaxy angular power spectrum including normalisation, because naively one would expect 
the shear and the galaxy power spectra to be measuring the same thing (at least in simple 
ACDM cosmologies). The different normalisations and weights W/*" for galaxies and shear 
do apparently have a strong effect on cosmological parameter estimation. 

A. 2 From two to three parameters 

Adding to the fit a freely varying neutrino mass m v complicates the picture considerably, 
and precludes semi-analytic degeneracy analyses such as those presented in the previous 
section for the two-parameter case. One reason for this is that there are no simple approxi- 
mations to the transfer function that incorporate the effects of ^m v . Another is that it is 
not obvious how one would project a three-dimensional degeneracy onto two dimensions, so 
as to explain our three-parameter fit results in figure 11, without resorting to some sort of 
simulation. 

Instead, we present an a posteriori plausibility argument. Two plots in figure 12 show 
samples with < 2 in (u m , h, ^ m Jy )-space from the Markov chains of our three-parameter 
fits of the galaxy and the shear power spectra, respectively. The galaxy power spectrum fit 
does not marginalise over the linear galaxy bias. The horizontal axes in the plots are always 
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Figure 11. Marginalised joint two-dimensional forecast 68%- and 95%-crcdible contours for h, us n 
and ^2 m v for the three-parameter model. The colour scheme is the same as that in figure 10. 



aligned with the /i-direction, while the boxes have been rotated around the /i-axis in the 
(w m , ^2 mi/)-plane in such a way that all samples appear to lie on as thin a line as possible. 
The direction perpendicular to the line therefore corresponds the parameter direction that is 
the most well-constrained. 

To quantify the rotation, we note that it is in fact equivalent to defining an effective 
parameter w e g- that is a linear combination of u m and ^ m v . As shown in figure 12, shear and 
galaxy measurements require different amounts of rotation, and hence also different effective 
parameters u e g. Phenomenologically, we find that 

Wefr.g = w m - 5.0 oj v , (A.12) 
cj effjS =u m - 3.0 u v (A. 13) 

work well for galaxies and shear respectively. Furthermore, once these rotations have been 
performed, the degeneracies between (w e fr,g, fr), and between (io e s iS ,h) become essentially 
identical to those between (u m ,h) in the two-parameter fits presented in figure 10. This 
observation indicates that it is the effective parameters w e ff ig and a; e fr,s to which galaxy and 
shear measurements are primarily sensitive, not u m and m f individually. 

Looking back at the two-dimensional marginalised posteriors in figure 11, we now see 
that the widening of the (w n , /^-ellipses in the case of shear is simply a consequence of the 
measurement being sensitive to w c fj >s rather than to w m itself, the latter of which can now 
take on a wider range of values because of a freely varying m u ■ In the case of the galaxy 
power spectrum, the positive correlation we now observe between u m and h in the three- 
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h 




Figure 12. Three-dimensional scatter plots of samples with < 2 from the Markov chains of our 
three-parameter fits. Black: Galaxy auto-spectrum only. Red: Shear auto-spectrum only. The boxes 
have been rotated in the (w m , m„)-plane such that all samples appear to lie on as thin a line as 
possible. 

parameter fit can also be justified: a negative correlation still exists between w e g jg and h (as 
it did between ui m and h in the two-parameter fit), but co m itself can be positively correlated 
with h, so long as ^ m u can be adjusted accordingly. We cannot however prove that this 
nipping of the degeneracy direction must occur. 
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